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Pen computing: a technology overview and a vision 

Andre Meyer 

July 1995 ACM SIGCHI Bulletin, volume 27 issue 3 

Full text available: pdf(5.14 MB) Additional Information: full citation , abstract , citings , index terms 

This work gives an overview of a new technology that is attracting growing interest in public 
as well as in the computer industry Itself. The visible difference from other technologies is in 
the use of a pen or pencil as the primary means of interaction between a user and a 
machine, picking up the familiar pen and paper Interface metaphor. From this follows a set 
of consequences that will be analyzed and put into context with other emerging technologies 
and visions. Starting with a short historic ... 

Poster Sessions: Chinese string searching using the KMP algorithm 

Robert W. P. Luk 

August 1996 Proceedings of the 16th conference on Computational linguistics - Volume 

2 

Full text available: pdff397.79 KB) Additional Information: full citation , abstract , references 

This paper is about the modification of KMP (Knuth, Morris and Pratt) algorithm for string 
searching of Chinese text. The difficulty is searching through a text string of single-and 
multi-byte characters. We showed that proper decoding of the input as sequences of 
characters instead of bytes is necessary. The standard KMP algorithm can easily be modified 
for Chinese string searching but at the worst-case time-complexity of 0(3n) in terms of the 
number of comparisons. The finite-automaton ... 

A browsing tool of multi-lingual documents for users without multi-lingual fonts 

Tetsuo Sakaguchi, Akira Maeda, Takehisa Fujita, Shigeo Sugimoto, Koichi Tabata 

April 1996 Proceedings of the first ACM international conference on Digital libraries 

Full text available: ^ pdf(789.68 KB) Additional Information: full citation , references , citings , index terms 






4 Learning evaluation functions to improve optimization by local search 

Justin Boyan, Andrew W. Moore 

September 2001 The Journal of Machine Learning Research, volume i 
Full text available: ^ pdff643.21 KB) Additional Information: full citation , abstract 
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This paper describes algorithms that learn to improve search performance on large-scale 
optimization tasks. The main algorithm, STAGE, works by learning an evaluation function 
that predicts the outcome of a local search algorithm, such as hillclimbing or Walksat, from 
features of states visited during search. The learned evaluation function is then used to bias 
future search trajectories toward better optima on the same problem. Another algorithm, X- 
STAGE, transfers previously learned evaluation ... 

5 Fixed versus variable order Runge-Kutta 

L. F; Shampine, L. S. Baca 

March 1986 ACM Transactions on Mathematical Software (TOMS), volume 12 issue 1 

Additional Information: full citation , abstract , references , index terms . 




Full text available: 'p. lpdfn.61 MB) 

review 

Popular codes for the numerical solution of nonstiff ordinary differential equations (ODEs) 
are based on a (fixed order) Runge-Kutta method, a variable order Adams method, or an 
extrapolation method. Extrapolation can be viewed as a variable order Runge-Kutta method. 
It is plausible that variation of order could lead to a much more efficient Runge-Kutta code, 
but numerical comparisons have been contradictory. We reconcile previous comparisons by 
exposing differences in testing me ... 

Satchel: providing access to any document, any time, anywhere Q 

Mik Lamming, Marge Eldrldge, Mike Flynn, Chris Jones, David Pendlebury 

September 2000 ACM Transactions on Computer-Human Interaction (TOCHI), volume 7 

Issue 3 

.- .. i ^ 1 ui iSii -i*/cn-i on i^d\ Additional Information: full citation , abstract , references , citings, index 

Full text available: T^ q pdf(591.29 KB) 

^"-^ terms 

Current solutions for providing access to electronic documents while away from the office do 
not meet the special needs of mobile document workers. We describe "Satchel," a system 
that is designed specifically to support the distinctive features of mobile document work. 
Satchel is designed to meet the following five high-level design goals (1) easy access to 
document services; (2) timely document access; (3) streamlined user interface; (4) 
ubiquity; and (5)compliance with securi ... 

Keywords: document access, document appliance, document processing, information 
appliance, mobile computing, mobile work 



' Japanese language handling in APL environments □ 

Kyosuke Saigusa 

July 1998 ACM SIGAPL APL Quote Quad , Proceedings of the APL98 conference on 

Array processing language, volume 29 issue 3 
Full text available: pdf(555.47 KB) Additional Information: full citation , abstract , references , index terms 

While Japan is amongst the first countries to use APL back in the early 1970's and enjoyed 
at one time as high as 40% of IBM main-frame customers using APL, the lack of Japanese 
language support of APL in popular computing environments today such as UNIX/AIX, OS/2 
and Windows seems to be driving APL out of the scene.This paper tries to bring up this 
subject more officially than I have done in the past for more public discussion not only for 
Japan, but also for those countries, where the characte ... 

s Document creation I: Creating structured PDF files using XML templates Q 

Matthew R. B, Hardy, David F. Brailsford, Peter L. Thomas 

October 2004 Proceedings of the 2004 ACM symposium on Document engineering 

Full text available: ^ pdf(166.87 KB) Additional Infornnation: full citation , abstract , references , index ternns 

This paper describes a tool for recombining the logical structure from an XML document with 
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the typeset appearance of the corresponding PDF document. The tool uses the XML 
representation as a template for the insertion of the logical structure Into the existing PDF 
document thereby creating a Structured/Tagged PDF. The addition of logical structure adds 
value to the PDF In three ways: the accessibility Is improved (PDF screen readers for visually 
Impaired users perform better) media options a ... 

Keywords: PDF, XML, logical structure Insertion 



9 Linguistic resource creation for research and technology development: A recent 




experiment 

Stephanie Strassel, Mike Maxwell, Christopher Cieri 

June 2003 ACM Transactions on Asian Language Information Processing (TALIP), 

Volume 2 Issue 2 

Fuli text available: ^ pdf(186.39 KB) Additional Information: full citation , abstract , references , index terms 

Advances in statistical nnachine learning encourage language-independent approaches to 
linguistic technology development. Experiments in "porting" technologies to handle new 
natural languages have revealed a great potential for multilingual computing, but also a 
frustrating lack of linguistic resources for most languages. Recent efforts to address the lack 
of available resources have focused either on intensive resource development for a small 
number of languages or development of technologies fo ... 

Keywords: Cebuano, Hindi, Machine translation, crosslanguage, information extraction, 
information retrieval, language parsing and understanding, linguistic resources, machine 
translation, summarization, text analysis, translingual information access technology 



10 The use of computer-simulated radiances to retrieve total atmospheric ozone data from I I 
satellite-mounted infrared multifilter radiometers 

Patrick P. Weidhaas 

March 1981 Proceedings of the 14th annual symposium on Simulation 

Full text available: fl^ pdf(827.15 KB) Additional Information: full citation , abstract , references , index terms 



Between March 1977 and February 1980, four satellites equipped with nnultlchannel filter 
radionneter (MFR) sensors supplied infrared radiance data at various spectral channels. The 
Satellite Ozone Analysis Center (SOAC) at the Lawrence LIvernnore National Laboratory 
received the radiance data from the U.S. Air Force. An ozone retrieval model was developed 
by SOAC in order to derive total ozone values from the radiance data. Statistical comparison 
against a network of ground observatories which ... 

A compression-based algorithm for Chinese word segmentation Q 

W. J. Teahan, Rodger McNab, Yingying Wen, Ian H. Witten 
September 2000 Computational Linguistics, volume 26 issue 3 

Full text available: ^ ^ 

paT(l.34 MB) Ttj^* Additional Information: full citation , abstract , references , citings 

Publisher Site 

Chinese is written without using spaces or other word delinnlters. Although a text may be 
thought of as a corresponding sequence of words, there is considerable annbiguity in the 
placement of boundaries. Interpreting a text as a sequence of words is beneficial for some 
information retrieval and storage tasks:for example, fulltext search, word-based 
compression, and keyphrase extraction. We describe a scheme that infers appropriate 
positions for word boundaries using an adaptive language model that ... 
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October 2001 Proceedings of the 19th annual international conference on Computer 

documentation 

Full text available: ^ pdf(296.90 KB) Additional Information: full citation , abstract , citings , index terms 

As more and more content is being produced and distributed in multiple formats, the issue 
of providing indexes for various formats becomes important. Indexes can be considered as 
residing witliin interfaces, rather than just more pages accompanying a printed piece, or a 
tab stuck in a online help file. Designing indexes that work equally well in any of the various 
interfaces a document may be displayed in, whether print, help, PDF, or on the Web, 
presents a real challenge. Understanding the struct ... 

Keywords: indexing, single-sourcing 



13 The Satchel system architecture: mobile access to documents and services Q 

Mike Flynn, David Pendlebury, Chris Jones, Marge Eldridge, Mik Lamming 
December 2000 Mobile Networks and Applications, volume 5 issue 4 

r- ii * ^ ui j*/nn-7 Additional Information: full citation , abstract , references , citings , index 

Full text available: T? ^ pdf(207.51 KB) — "* ^ 

^ terms 

Mobile professionals require access to documents and document&dash;related services, such 
as printing, wherever they may be. They may also wish to give documents to colleagues 
electronically, as easily as with paper, face&dash;to&dash;face, and with similar security 
characteristics. The Satchel system provides such capabilities in the form of a mobile 
browser, implemented on a device that professional people would be likely to carry anyway, 
such as a pager or mobile phone. Printing may be per ... 

14 Poster session: Converting on-line bilingual dictionaries from human-readable to I I 
machine-readable form 

James Mayfield, Paul McNamee 

August 2002 Proceedings of the 25th annual international ACM SIGIR conference on 

Research and development in information retrieval 

r- II * ^ I ui isn ^*/on4 nn (✓Dx Additional Information: full citation , abstract , references , citings , index 
Full text available: 'p j pdf(201.99 KB) 

terms 

We describe a language called ABET that allows rapid conversion of on-line human-readable 
bilingual dictionaries to machine-readable form. 

Keywords: bidicts, bilingual dictionaries, cross-language information retrieval, information 
extraction, scripting languages 



a comparison of Chinese document indexing strategies and retrieval models Q 

Robert W. p. Luk, K. L. Kwok 

September 2002 ACM Transactions on Asian Language Information Processing (TALIP), 

Volume 1 Issue 3 

Full text available: W\ pdf(419.42 KB) Additional Information: full citation , abstract , references , index terms 



With the advent of the Internet and intranets, substantial interest is being shown in Asian 
language information retrieval; especially in Chinese, which is a good example of an Asian 
ideographic language (other examples include Japanese and Korean). Since, in this type of 
language, spaces do not delimit words, an important issue is which index terms should be 
extracted from documents. This issue also has wider implications for indexing other 
languages such as agglutinating languages (e.g., Finni ... 

Keywords: Chinese information retrieval, comparison, indexing strategies 
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16 Papers: Identifying, the coding system and language, of on-line documents on the I I 
Internet 

Gen-ltiro Kikui 

August 1996 Proceedings of the 16th conference on Computational linguistics - Volume 

2 



Full text available: "^u pdf(523.04 KB) Additional Information: full citation , abstract , references 



This paper proposes a new algorithm that simultaneously identifies the coding system and 
language of a code string fetched from the Internet, especially World-Wide Web. The 
algorithm uses statistic language models to select the correctly decoded string as well as to 
determine the language. The proposed algorithm covers 9 languages and 11 coding systems 
used in Eastern Asia and Western Europe. Experimental results show that the level of 
accuracy of our algorithm Is over 95% for 640 on-line docume ... 

17 Polyglot Emacs 20.4: A look at multilingual Emacs 

Jon Babcock 

March 1999 Linux Journal 

Full text available: 1^ html(23.41 KB) Additional Information: full citation , references , index terms 



18 Adaptive Hindi OCR using generalized Hausdorff image comparison Q 

Huanfeng Ma, David Doermann 

September 2003 ACM Transactions on Asian Language Information Processing (TALIP), 

Volume 2 Issue 3 

Full text available: ^ pdff280.45 KB) Additional Information: full citation , abstract , references , index terms 

We present an adaptive Hindi OCR implemented as part of a rapidly retargetable language 
tool effort. The system includes: script Identification, character segmentation, training 
sample creation, and character recognition. In script identification, Hindi words are Identified . 
from bilingual or multilingual documents based on features of the Devanagari script or using 
Support Vector Machines. Identified words are then segmented into individual characters in 
the next step, where the composite charac ... 

Keywords: Optical character recognition (OCR), document processing, generalized 
Hausdorff image comparison, script Identification 
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Since an electronic book (eBook) has various advantages over a paper-based book, there is 
a growing Interest in eBooks. EBook industries are using different document formats 
creating difficulties in the interoperability and interchangeability of eBooks. The U.S. and 
Japan have realized the importance of a standard format for eBooks and developed such 
standards. Recently, Korea has also developed a standard format based on XIML. The 
primary goal of this standard is to support the interchangeabilit ... 

Keywords: XML, a standard format, an electronic book, eBook 
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A design represents a compromise between conflicting goals, and the design of the personal 
computer of the year 2000 is no exception. We seek something that will fit comfortably into 
people's lives while dramatically changing them. This may appear to be a contradiction that 
cannot be reconciled. But if the technology does not fit easily into the habits and lifestyles of 
its human users, it will be discarded by those it was meant to help. And if this new tool does 
not change the life of its ... 
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